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LEUCINE-RICH REPEATS (LRRs) were 
first discovered in leucine-rich (^-glyco- 
protein, a protein of unknown function 
from human serum 1 . LRR-containing 
proteins now represent a diverse group 
of molecules with differing functions 
and cellular locations in a variety of 
organisms (Tabie 1, Fig. 1). LRRs are 
usually present in tandem, and the num- 
ber of LRR motifs ranges from one, in 
platelet glycoprotein Ibp, to 30, in ' 
chaoptin. The most common length of 
an LRR is 24 residues, but repeats con- 
taining any number between 20 and 29 
residues are found. The recent X-ray 
structure analysis of ribonuclease 
inhibitor (RI) from pig liver 2 (Fig. 2), a 
protein built entirely from LRRs, pro- 
vided the first picture of the three- 
dimensional architecture of LRRs. 

Consensus sequences of LRRs and 
evolutionary Implications 

As the name implies, LRRs are distin- 
guished by a consensus sequence con- 
sisting predominantly of leucines (Table 
I). LRRs are easily overlooked by the 
commonly used methods for sequence 
comparison because they contain gaps, 
vary in length and amino acid compo- 
sition, and therefore have a degener- 
ate consensus sequence. Consequently, 
LRRs in many proteins were not 
detected for some time. 

The consensus sequence compiled 
from all known LRR-containing proteins 
contains leucines or other aliphatic 
residues at positions 2, 5, 7, 12, 16, 21 
and 24, and asparagine, cysteine or thre- 
onine at position 10 (Table 1). Most pro- 
teins contain exclusively asparagine at 
position 10, but three have exclusively 
cysteine in this position (Table I). The 
functional and evolutionary significance 
of the variations of consensus residues 
at position 10 is not known. 

Consensus sequences derived from 
LRRs in individual proteins often contain 
additional conserved residues in pos- 
itions other than those mentioned above; 
mostly these are aliphatic and aromatic 
amino acids; sometimes glycines and 
prolines, but seldom other amino acids. 
The hydrophobic consensus residues in 
the carboxy-terminal parts of the repeats 
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The leucine-rich repeat: a 
versatile binding motif 

Bostjan Kobe and Johann Deisenhofer 

Leucine-rich repeats are short sequence motifs present in a number of 
proteins with diverse functions and cellular locations. All proteins contain- 
ing these repeats are thought to be involved In protein-protein inter- 
actions. The crystal structure of ribonuclease inhibitor protein has 
revealed that leucine-rich repeats correspond to p~ct structural units. 
These units are arranged so that they form a parallel p-sheet with one sur- 
face exposed to solvent, so that the protein acquires an unusual, non- 
globular shape. These two features may be 'responsible for the protein- 
binding functions of proteins containing leucine-rich repeats. 



are commonly spaced by 3, 4 or 7 
residues. 

About half of the genes encoding the 
known LRR-containing proteins have 
been sequenced. Gonadotrophs recep- 
tor genes contain several similarly sized 
exons in their LRR domains that are 
homologous in their exon-intron junc- 
tion sequences 3 , suggesting that the LRR 
domain evolved by exon duplication 
and shuffling from a single prototypic 
exon corresponding to one LRR. Such 
an evolutionary relationship is less obvi- 
ous in genes encoding other LRR- 
containing proteins, but the introns often 
localize at similar positions in the 
repeats (most frequently between pos- 
itions I and 6, less frequently between 
positions II and 18, and rarely else- 
where). The most likely mechanism of 
evolution of LRR-containing proteins 
would therefore comprise unequal 
crossovers and duplications of gene 
fragments corresponding to prototypic 
leucine-rich building blocks. Two evol- 
utionary scenarios are possible: (1) an 
ancestral LRR module evolved first, and 
duplicated to give rise to individual 
members of the superfamily that later 
developed their own unique consensus 
sequences; or (2) the duplications 
occurred separately in each protein fam- 
ily and, owing to structural constraints, 
gave rise to similar, but unique, consen- 
sus sequences. These two schemes are 
not mutually exclusive and there is good 
reason to believe that the divergent and 
convergent scenarios have worked in 
conjunction to yield the LRR superfamily 
that exists today. The variation in;the 
length of LRRs among proteins and the 



presence of consensus sequences 
unique to a particular protein argue for a 
rich and complex evolutionary history 
that may have sprung from more than 
• one root. 

Three-dimensional structure of LRRs 

The recently determined crystal 
structure of porcine ribonuclease 
inhibitor protein (RI) 2 (Fig. 2a) has 
allowed us to examine the three- 
dimensional architecture of LRRs. RI is a 
cytoplasmic protein that inhibits ribo- 
nucleases from the pancreatic super- 
family by binding very tightly to an ex- 
tensive surface area of ribonuclease 
containing its catalytic site 4 . The se- 
quence of RI consists of 15 alternating 
LRRs of 28 (A-type) or 29 (B-type) resi- 
dues each (Fig. 2b). Ninety percent of 
the structure is built of these repeat 
units. The short amino- and carboxy- 
terminal segments display a weaker 
homology to the internal repeat motif. 

In the RI molecule, LRRs correspond 
to g-a structural units, consisting of a 
short p-strand and an a-heiix approxi- 
mately parallel to each other (Fig. 2). 
All repeats, including the terminal seg- 
ments, adopt very similar structures; 
All structural repeats contain 28 or 29 
residues, except the amino-terminal 
repeat, which contains 25 residues. 
The structural units are arranged so that 
all the p-strands and the helices are 
parallel to a common axis, resulting 
in a nonglobular, horseshoe-shaped 
molecule with a curved parallel p-sheet 
lining the inner circumference of 
the horseshoe, and the helices flanking 
its outer circumference. 
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Table f. Proteins with leuclne-rich repeats 



Protein (species) 0 



Function-Jigand" 



Location** Repeats' Length' Consensus sequence* 



PIR' entry 



RNase inhibitor (porcine) 

leucine-rich aZGP (human) 
RNAi (Saccfiarorm/ces cerevisiae) 
U2 snRNP A* (human) 
Blglycan (human) 

Decorin (human) 



fibromodulin (bovine) 

Lumican (chicken) 
Proteoglycafrtb (chicken) 
Osteoinductive factor (bovine) . 
Platelet GP Iba (human)' 
Platelet GPV (human) 
YopM {Yersinia pestis) 
lpaH7.8 (Shigella ftexnen) 
lpaH4.5 {Shigella flexnerf) 
Toll (Drosophila) 
Slit (Drosophila) 
Connectln {Drosophila) 
Ch'aoptin (Drosophila) 
Flightless-! (Drosophila) 
Oligodendrocyte myelin 6P 
(human) 
CD14 (human) 
Trk (human) 
TrkB (mouse) 

TrkC (porcine) 

TMK1 (Arabldopsis thaiiana) 

LH-CG receptof (rat) 

FSH receptor (rat) 

TSH receptor (dog) 

Adenylate cyclase (Saccharomyces 

cerevisiae) 
T-LR (Trypanosoma brucel) 
RADX (Saccharomyces cerevislae) 
RAD7 (Saccharomyces cerevlsiae) 
DRT100 {Arabidopsls thaiiana) 
GRR1 (Saccharomyces cerevislae) 
CCR4 (Saccharomyces cerevisiae) 
sds22 (Schizosaccharomyces 

pombe) 

p34 ribosorne-binding protein (rat) 



, RNase Inhlbitor-RNase 
?-? 

RNA processing-? 
Splicing-U2 snRNP 
ECM binding-laminin, 

fibronectln, TGF-0 
ECM binding-collagen, 

fibronectin, thrombospondin, 

TGFtf 

ECM binding-collagen, 
fibronectin 

Corneal transparency-? 
?-? 

Bone morphogenesis-BMP 
Cell adhesfon-vWF, thrombin 
Cell adheslon-GP IX, GP lb 
virulence factor-thrombln 
?-? 
?~? 

Embryo development-? 
Axon development-? 
Synapse development-? 
Photoreceptorcell development- 
Embryo development-? 
Myellnation-? 

Cell-surface receptor-tPS-iPB 
Receptor protein kinase-NGF 
Receptor protein kinase-BDNF, 
NT-3 

Receptor protein Wnase-NT-3 
Receptor protein kinase-? 
Signal transduction-LH, CG 
Signal transductioo-FSH 
Signal transduction-TSH 
Signal transductlon-RAS 

DNA repair^AOlO 
ONA repair-? 
Recombination-? 
Signal transduction-? 
Transcription-? 
Mitosis-dis2, sds21 

RM membranes-ribosome 



Cytoplasm 15 



28(A) 



Cartooxypeptidase N (human) Stabilization-catalytic subunit 
Internalln (Listeria monocytogenes) Invasion-? 
Inl8 (Listeria fnonocytogenes) ?-? 

LRR superfamlly 



Serum 


8 


24 


Cytoplasm 


8 


29 


Nucleus 


4 


24 




8 


24 


ECM 


10 


24 


COM 


11 


24 


ECM 


12 


24 


tOM 


6 


24 




6 


24 


PM (EC) 


7 


24 


PM(EC) 


14 


24 


IC + EC 


12 


20 




6 


20 


? 


8 


20 


PM (EC) 


19 


24 


EC 


19 


24 


PM (EC) 


7 


24 


-7 PM (EC) 


30 


24 


PM (EC) 


16 


23 


PM (EC) 


8 


24 


PM(EC) 


8 


27 


PM (EC) 


2 


23 


PM (EC) 


3 


23 


PM (EC) 


3 


23 




11 


23 * 


PM(EC) 


.5 


25 


PM(EC) 


7 


25 


PM(EC) 


6 


25 


PM 


20 


23 


(cytoplasm) 






? 


18 


23 


Nucleus 


3 


23 


? 


5 


26 


Chloropiast 


5 


24 


Cytoplasm 


9 


26 


? 


4 


23 


Nucleus 


11 


22 


RM membrane 


4 


' 24 


(cytoplasm) 






Plasma 


12 


24 


Cell wail 


13 


22 


? 


6 


22 
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A31857 

N6HUA2 
BVBYN1 
$03616 
A40757 

NBHUC8 



S05390 

A41748 
A41781 
A35272 
NBHUIA 

A33950 
A35149 
S1824B 
A29943 
A36665 
S28464 
A29944 

A34210 

TDHUM4 

TVHUTT 

S06943 

A40026 " 

JQ1674 

A41343 

A34548 

A40O77 

OYBY 

A36359 
DOBY01 
A25226 
A46260 
A41529 
S31286 
A38439 



A34901 
A39930 
C39930 



e A single example is given for proteins where sequences from several species are known. Proteins with less than two LRRs are not shown Abbreviations: 
RNase, ribonuclease; GP, glycoprotein; snRNP, small nuclear ribonucleoprotein particle; ECM, extracellular matrix; PM, plasma membrane* EC extracellular; 
TGF, transforming growth factor; IC, intracellular; BMP, bone^norphogenfc protein; vVyF, von Willebrand factor; LPS-LPB, complex of lipo polysaccharide and 
lipopofysacchande-binding protein; NGF, nerve growth factor; BDNF, brain-derived neurotrophic factor; NT-3, neurotrophirv3: LH. lutrophin- CG chorio- 
gonadotrophin; PSH, follitrophin; TSH, thyrotrophin; T-LR, trypanosomal leucine-rich protein; RM membrane, rough microsomal membrane. ' ' 

''Cellular distribution of the protein. For membrane-associated proteins, the location of the LRR domain is indicated in parentheses. 

'Total number of repeats is the number of occurrences of the a..a.a..N/C/T sequence, where 'a' represents A, V, l, I, F, Y or M - repeats shorter than 18 
residues and isolated single repeats were not counted. Only- the counted repeats were used to determine the consensus sequence. Because other authors may 
have used different criteria, the numbers of repeats, the lengths and the consensus sequences mayvary from those originally reported. 

■The most common length of all the repeats in the protein is given. For some proteins, there are significant deviations from this number. RNase Inhibitor is 
aligned so that two types of repeats (A-type and B-type) alternate in the sequence. 

•The numbers above the sequences indicate the positron in the repeat In reference to the consensus of porcine RNase inhibitor. One-letter code is used for 
amino acids. An amino acid Is included in the consensus if present at that position In more than half of the repeats; 'a' represents A V L I F Y or M and is 
included in the consensus If these amino acids ere present at that position in more than 80% of the repeats. Symbols used- ' ' any amino add* gap; 
'±\ amino acid may or may not be present at this position. ' ' 

'Protein Identification Resource sequence data bank. References for the proteins without PIR entries: Platelet <3V, Hickey, M J. et al (1993) Proc NaV Acad. 
*a/S °' ^ ^ (1 " 3) ^ USA 9 °' U386 - 11390: PS* *0MmrtMN Protein/Ohsuml, T. 
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Figure 1 

Schematic diagrams of the primary structures of proteins with leucinerrich repeats. For abbreviations and references, see Table I. 



The consensus residues of the 
repeats play structural roles. The 
sidechains of leucines and of other 
aliphatic residues (positions 2, 5, 7, 12, 
17, 20, 23 and 24), of cysteines {position 
17), and of asparagines or cysteines 
(position 10) form the core of the pro- 
tein [note that the consensus positions 
specific to.RI are slightly different from 
the overall consensus of LRR-containing 
proteins (Table I)|. The spacing of 
leucines is responsible for proper pack- 
ing of the 0«strands and a-helices. The 
remaining consensus residues in Rl 
have other roles. For example, in the 
B-type repeat, glycine at position 13 of 
the repeat seems to be a helix-initiation 
signal, while proline at position 27 is a 
helix-termination signal. Because of the 
arrangement pf the p-a units, side- 
chains of consensus residues align in 
stacks, both at the core and on the 
surface of the protein. 



It is unlikely that a single leucine-rich 
motif would adopt the conformation 
observed in the Rl protein. Neighboring 
repeats form many polar and nonpolar 
contacts; also, the p-strands tend to 
pack against the helices that precede 
and follow the p-strand in question. 
Synthetic peptides that correspond to a 
single repeat have low propensity to 
adopt a defined structure 5,6 . 

Secondary-structure predictions 
of LRRs show no clearly repetitive 
patterns 1 . Consequently, before the 
structure of Rl had been determined, 
both an a-helix 7 and a p-strand 
conformation 8 was proposed as the 
major secondary structural element 
of LRRs in different proteins, p-tums 
were, however, correctly predicted in 
the vicinity of the conserved 
asparagine 9 . 

Leu cine-rich repeats should not be 
confused with the more widely known 



leucine zippers. Leucine zippers are 
motifs found in oligomeric proteins, 
including many DNA-binding proteins 
such as the products of the c-fos and 
c-;im proto-oncogenes. Leucine zippers 
are built of heptad repeats with leucines 
present at every seventh position. These 
leucines form a hydrophobic ridge on 
one side of an a-helix that interacts with 
a complementary helix 10 . The only simi- 
larity between LRRs and leucine zippers 
is the conservation of leucines at par- 
ticular spacings; these leucines cannot 
usually be replaced by other hydropho- 
bic amino acids. However, the spacing 
between leucines differs between LRRs 
and leucine zippers, with only certain 
leucines in the carboxy-terminal part 
of LRRs spaced by seven residues. 
The function of leucines in LRRs and 
leucine zippers is also different; while 
leucines in leucine zippers participate in 
oligomerization, the buried leucines in 
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Figure 2 

Structure of ribonuclease inhibitor (Rl). (a) Ribbon diagram of the structure of porcine RL 

(b) Consensus sequences and secondary structure of the A- and B-type leucine-rich 
repeats {LRRs) of porcine Rl. One-letter amino acid code is used; Y indicates any amino 
acid, V denotes an aJiphatic amino acid. The part of the repeat that is strongly conserved 
throughout the LRR superfamily is underiined, and the conserved residues are bold. Below 
the sequence, elements of secondary structure are Indicated. The solid line marks the 
core region of the 0-sheet or «-heilx; the dots denote extensions of the helix in different 
repeats. The types of p-tums are shown below the corresponding i + 1 and 1 + 2 positions. 

(c) Stereo plot of the main chain of a representative leucine-rich repeat corresponding to 
residues 195-222 of porcine Rl. Sidechains of only the consensus residues are shown; 
bonds In the conserved residues are filied. 
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LRRs play a structural role and cannot 
be directly involved in protein-protein 
interactions. 

Rl as a model for other LRR-containfng 
proteins 

We believe that the repeats in Rl are a 
satisfactory structural model for LRRs in 
other proteins. Most consensus residues 
that play key structural roles are con- 
served throughout the superfamily, 
especially the consensus residues that 
cluster around the p-strand region (Table 
I, Fig. 2). Although the LRRs in Rl are 
among the longest found in the super- 
family, shorter repeats could easily adopt 
a similar p-d conformation, as dem- 
onstrated by the amino-terminal struc- 
tural repeat in Rl, which contains 25 
■residues. 

We must, however, allow the possi- 
bility that in certain LRR-containing pro- 
teins the helical parts might be substi- 
tuted with more extended structures. 
This would most probably occur in those 
proteins that contain shorter variants of 
LRRs which, in the carboxy-terminal 
regions of the repeats, lack the appro- 
priate pattern of hydrophobic residues 
expected for an amphipathic helix, and 
contain prolines and other amino acids 
with low propensities to form a-helices. 

Substitution of the helical part of the 
repeats with a more extended structure 
would influence the overall shape of the 
protein considerably, particularly its 
curvature. Two protein folds exist that 
could shed light on the plausible archi- 
tecture of a structure with p-strands sub- 
stituting for a-helix. These are the 
0-rolI folds of pectate lyase C 11 (PelC) 
and aikaJine protease 12 (AP) (Fig. 3). 
Both p-roll folds are assembled from 
repetitive structural units and expose a 
surface of a parallel (3-sheet to solvent 
In addition, in PelC, as in Rl, a ladder of 
asparagine residues at the carboxy- 
terminal ends of the p-strands hydrogen 
bonds to the backbone. The length of a 
structural repeat of the PelC-type 'P- 
helix' is 22 residues, and the length of a 
turn of the AP-type p-roll is 18 residues. 
Neither PelC nor AP contain LRRs. 

In proteins that contain over 21 LRRs 
and retain the curvature of the structure 
of Rl, the overall organization of the 
repeats must be different from that in Rl 
to prevent the termini from colliding. 
Such a collision could be avoided by a • 
twist in the sheet. Alternatively, smaller 
numbers of repeats could constitute 
separate domains, linked by segments 
adopting conformations different from 
the p~ct units. 
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LRR-containing proteins: functional and 
evolutionary similarities 

LRRs are found in a functionally and 
evolutionarily diverse set of proteins. 
Although few properties are shared 
among all members of the LRR super- 
family, two main themes emerge: all 
LRR-containing proteins appear to be 
involved in protein-protein interactions, 
and at least half of them take part in sig- 
nal transduction pathways (Table I). 
LRRs must therefore provide a structural 
skeleton for the achievement of diverse, . 
specific molecular interactions. 

A property of the structure of the LRR 
that may be helpful in protein binding Is 
its nonglobular shape. Because of this 
shape, the area available for interaction 
with smaller globular proteins is sub- 
stantially increased, facilitating more 
interactions, and thus enhancing affinity. 

Another property of the structure of 
LRRs that might be useful for protein 
binding is the exposed surface of the 
parallel p-sheet. This structural feature is 
rare in proteins, and has only been dis- 
covered recently in structures of PelC 11 , 
AP 12 and RI 2 . The p-roll structures of 
PelC and AP were proposed to be 
responsible for transport across mem- 
branes. However, such a surface feature 
could also aid in protein binding. 
Parallel p-sheets are common in the 
interior of proteins, where helices or 
other parallel p-sheets pack against 
them. In the case of LRR-containing 
proteins, the interaction with the 
Hgand could substitute for such packing 
interactions. 

In addition to the properties common 
to most LRR-containing proteins, such 
as involvement in protein-protein inter- 
actions and signal transduction, further 
similarities restricted to fewer members 
of the superfamily can be found, on 
the basis of both their functions and 
the characteristics of their repeats. 
These often point to evolutionary and 
functional relationships among LRR- 
containing proteins. The specificity of 
proteuirprotein interactions of LRR-con- 
taining proteins and, therefore, the basis 
of their functions are most probably due 
to the specific composition of noncon- 
sensus residues, and might also be influ- 
enced by the lengths of the repeats and 
the flanking domains. These functional 
and evolutionary relationships are dis- 
cussed below, and the remarkable 
diversity of functions of LRR-containing 
proteins is demonstrated. 

Relatives of RI. The closest known rela- 
tives of RI seem to be the RNA1 protein, 
which is involved in RNA processing 11 , 
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Figure 3 

Schematic diagrams of the repetitive structural units of (a) ribonuclease Inhibitor, (b) 
pectate lyase C, and (c) the p-ro!l domain of alkaline protease. 



and the A' protein from the U2 small 
nuclear ribonucleoprotein particle 
(snRNP), which is required for snRNP 
assembly 6 . RI and RNA1 also have the 
most closely related patterns of con- 
served residues among the LRR pro- 
teins. RI and A' protein both interact with 
RNA-binding proteins, and therefore 
may be functional relatives. 

Adhesive proteins represent the largest 
group in the LRR superfamily. These 
proteins all contain similar 24-residue 
LRRs. One family of adhesive LRR-con- 
taining proteins includes the small pro- 
teoglycans: biglycan, fibromodulin, 
decorin, lumican, proteoglycan-Lb and 
osteoinductive factor (OIF, also called 
osteoglycin) 9 . Small proteoglycans bind 
various components of the extracellular 
matrix and growth factors (Table I). 
Decorin and fibromodulin regulate col- 
lagen-fibril formation*; and OIF, in con- 
junction with the transforming growth 
factors TGF-p and TGF-02, induces bone 
formation 14 . 

Several adhesive LRR-containing 
proteins are found in Drosophila. Toll, 
slit, connectin, chaoptin and flightless-I 
appear to orient cells during develop- 
ment; each protein is specific to par- 
ticular tissues 1 *" 19 . Connectin, chaoptin 
and a similar mammalian protein that 
may be involved in myelination (oligo- 
dendrocyte-myeGn glycoprotein, OMGP) 20 
are attached to the membrane via a gly- 
cosylphosphatidylinositol (GPD anchor. 

Another family of adhesive proteins 
comprises the proteins of the Ib-V-IX 
system of platelet glycoproteins. This 
complex constitutes the receptor for von 
Willebrand factor (vWF) and mediates 
the adhesion of platelets to injured vas- 
cular surfaces 21 . All polypeptides in the 
complex contain LRRs, but glycopro- 
teins Ibp and IX contain only one 
leucine-rich motif. Although it is unlikely 



that a single motif by itself would nor- 
mally adopt the conformation observed 
in the crystal structure of RI, such a con- 
formation could be stabilized by other 
proteins in the complex. Bacterial pro- 
teins YopM and IpaH interfere with 
platelet-mediated inflammatory re- 
sponses, supposedly owing to homology 
with glycoprotein lba 22 . 

.Signal-transducing receptors. The LRR 
surjerfamily contains several families of 
signal-transducing receptors. CD 14 is a 
GPI-anchbred receptor that, upon binding 
the complex between bpopofysaccharide 
and its binding protein, triggers intracellu- 
lar protein tyrosine phosphorylation, 
which in turn leads to an antibacterial 
response by macrophages 23 . 

The proto-oncogene trk encodes a 
transmembrane protein, Trk, with a 
cytoplasmic tyrosine kinase domain 24 . 
The extracellular domain that contains 
LRRs is involved in high-affinity binding 
of nerve growth factor; other proteins 
from the Trk family selectively bind 
other neurotrophins (Table I). TMK1, a 
plant protein with structural and bio- 
chemical properties similar to the re- 
ceptor protein kinases, contains serine/ 
threonine kinase in its cytoplasmic 
domain 25 . The receptor protein kinases 
are related to adhesive proteins such as 
Toll and glycoprotein lb, as they are tar- 
geted to the plasma membrane and 
contain an extracellular LRR domain, a 
transmembrane domain and a carboxy- 
terminal cytoplasmic domain involved in 
signal transduction. 

G-protein-coupled gonadotropin 
receptors consist of an extracellular, 
hormone-binding LRR domain, a domain 
containing seven membrane-spanning 
segments and a short cytoplasmic tail 26 . 
Their ligands, chorionic gonadotropin, 
luteinizing hormone, follicle-stimulating 
hormone and thyroid-stimulating 
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hormone, are heterodimeric glycopro- 
teins composed of a specific p-subunit 
noncovalently attached to a common a- 
subunit 

Other signal-transducing proteins. Yeast 
adenylate cyclase contains LRRs in a 
domain that is important for the regu- 
lation of its catalytic activity by the RAS 
protein 27 . The trypanosomal leucine-rich 
protein may be related to this domain 
because it is encoded in the variant- 
surface-glycoprotein expression site 
adjacent to an adenylate cyclase 28 . 

Two LRR-containing proteins take 
part in the glucose repression pathway 
of yeast. GRR1 is the primary response 
element in this pathway 29 , and CCR4 is 
required for the transcription of glucose- 
repressible alcohol dehydrogenase 30 . 

Other proteins. RAD1 and RAD 7 consti- 
tute another functional group of LRR- 
containing proteins, as they are both 
involved in nucleotide excision repair in 
yeast RAD1 forms a functional complex 
with RADIO, but the binding region lies 
outside the LRR-containing sequence 31 . 
Genetic evidence suggests that RAD7 
may interact with RAD23 (Ref. 32). 

RAD1 also plays a role in mitotic 
recombination 33 . Similarly, DRT100, a 
plant cDNA encoding a protein lhat 
contains LRRs and a micleotide-binding 
motif, complemented deficiencies in 
homologous recombination in Escher- 
ichia co// M . 

The remaining LRR-containing pro- 
teins thus far appear unrelated to other 
members of the superfamily, but also 
have important functions: sds22 regu- 
lates protein phosphatase- 1 function in 
fission-yeast mitosis 35 ; the ribosome- 
binding protein p34 may function as 
the ribosome receptor in the secretory 
pathway 36 ; the two LRR-containing 
subunits of the carboxypeptidase N 
tetramer stabilize the catalytic subunits 
and keep them in circulation 37 ; and 
intemaJin confers invasive properties oh 
Listeria 3 *. 

Flanking cysteine clusters. Many LRR- 
containing proteins also contain 
homologous regions flanking the LRR 
domains (Fig. 1). These. regions are 
characterized by four similarly spaced 
cysteines in a. stretch of about 20 
amino acids for the ammo-flanking 
region and about 50 amino acids 
for the carboxy-flanking region 16 - 39 ; 
the consensus sequences can be 
described as CP(~2x|CxC|~6xlC 
for the amino-flanking and PxxCxC 
(~20xlC[-20xlC for the carboxy- 
flanking regions (x denotes any amino 
acid). The amino-flanking cysteine 
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clusters occur in all small proteoglycans, 
glycoproteins Ibp and IX, Toll, slit, 
. OMGP, CD 14, the Trk family and 
gonadotropin receptors; related se- 
quences without all four cysteine 
residues conserved are present in 
leucine-rich a2-glycoprotein, glycopro- 
teins Iba and V, lpaH4.5, chaoptin and 
carboxypeptidase N. The carboxy- 
flanking cysteine clusters occur in glyco- 
proteins Ibct, Ibp, V and IX, Toll, slit and 
the Trk family; related sequences in 
which all four cysteines are not con- 
served are present in leucine-rich a2- 
glycoprotein and connectin. A different 
type of carboxy-flanking sequence con- 
taining two cysteines is found in small 
proteoglycans 39 . The flanking cysteine 
clusters appear to be a property of ad- 
hesive proteins and receptors. 

LRRs with known function. In several 
cases, LRRs are responsible for specific 
functions. In RI, RNA1, small proteogly- 
cans, chaoptin, CD 14 and sds22, the tan- 
dem repeats comprise essentially the 
entire primary structure. In glycoprotein 
Iba, the binding sites for vWF and 
thrombin have been mapped to the 
LRR-containing domain; although LRRs 
do not appear to bind. vWF directly 21 , 
point mutations in the LRR domain 
cause variants of Bernard-Soulier 
disease, a bleeding disorder (see, for 
example, Ref. 40). LRRs appear to be 
responsible for the regulation of yeast 
adenylate cyclase by RAS 27 , for the hor- 
mone selectivity of the gonadotropin 
receptors 41 and for the function of CCR4 
(Ref. 30). The single leucine-rich motif in 
the type 1 human immunodeficiency 
virus Rev protein is the frans-activating 
region of the virus 42 . 

Concluding remarks 

LRR-containing proteins represent 
an important superfamily involved in 
protein-protein interactions and signal 
transduction: The wide range of func- 
tions of LRR-containing proteins suggests 
that LRRs are generally useful protein- 
binding motifs; the variability of primary 
structures of these proteins suggests that 
these proteins do not necessarily share a 
common ancestor. The crystal structure 
of RI provides a model for the architec- 
ture of LRRs and helps us understand 
why LRRs may be useful for protein 
binding; the two structural features that 
might be responsible for the binding 
function are the solvent-exposed parallel 
0-sheet and the nonglobular shape of 
the structure. Owing to the variability of 
repeats and functional diversities of 
LRR-containing proteins, however, the 
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crystal structure of RI can only represent 
a platform for further structural studies 
of these proteins. 
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Aquaporins: water channel 
proteins of plant and animal 
cells 

Maarten J. Chrispeels and Peter Agre 

Certain biological membranes, such as the erythrocyte plasma mem- 
brane, have a high osmotic water permeability, and such membranes have 
long been suspected of harboring water.channels. The molecular identity 
of these channels has now been established with the purification of 
water-channel proteins and the cloning of the genes encoding them. 
Homologous water-channel proteins, called 'aquaporins', are present in 
plants and animals. These channels are water selective and do not allow 
ions or metabolites to pass through them. Their discovery is providing 
new insights into how plant and animal cells facilitate and regulate the 
passage of water through their membranes. 



CELLS ARE COMPOSED primarily of 
water, and movement of water into and 
out of cells is a key feature of numerous 
physiological Functions in animals, plants 
and microorganisms. The plasma mem- 
branes of renal proximal tubules, certain 
other mammalian epithelia and erythro- 
cytes are highly permeable to water, 
which moves in the direction of an 
osmotic gradient (see Refs 1, 2 for 
reviews). In plants, water moves through 
the living tissues in response to hydro- 
static and osmotic forces. During the 
day, when the stomata are open, a 
constant transpiration stream moves 
through the tissues; water enters the 
roots and moves up the conductive tis- 
sues of the stem, and water vapor exits 
the leaves. 

Numerous studies on the water per- 
meability of mammalian epithelia and 
the hydraulic conductance of plant tis- 
sues and cells have been made, and the 
plasma membranes are thought to be 
the major barriers to water flow. The 
lipid bilayer is somewhat permeable to 
water resulting from simple diffusion, 
and the diffusional water permeability 
coefficient (P d ) of many membranes is 
<0.01 cm sr 1 . This process cannot be in- 
hibited pharmacologically and is charac- 
terized by a high Arrhenius activation 
energy (E a > 10 kcal moH, i.e. reduced 
water movement at lower temperatures 
where lipid packing is tighter). Pores or 
water channels were hypothesized to 
explain the more rapid transmembrane 
water movements characteristic of eryth- 
rocytes, certain mammalian epithelia 
and plant-tissues. Unlike diffusional water 
permeability, channel-mediated water 
movement across erythrocyte mem- 
branes is reversibry inhibited by mercuric 
chloride and exhibits a low Arrhenius 
activation energy. (E d < 5 kcai moH). . 
The osmotic water permeability 
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coefficient (P r ) of erythrocyte mem- 
branes measures approximately 0.02 cm 
sec 1 . Although erythrocyte membranes 
and certain other biological membranes 
have P f >> P d , suggesting the existence 
of water channels, the molecular identity 
of the water channels remained elusive 
until the recent identification of CHIP 
and a growing list of evolulionarily re- 
lated water-channel proteins in animals 
and plants. 

An ancient family of channel proteins 

The aquaporins are a functionally 
defined group of water-transport pro- 
teins that are part of an ancient family of 
channel proteins called the MIP family, 
after the first identified member, major 
intrinsic protein of the mammalian lens. 
MIP comprises approximately 50% of the 
total membrane protein of lens-fiber 
cells, and the cDNA encoding it was 
isolated from a bovine lens library 4 . 
Hydropathy analysis of the deduced 
amino acid sequence predicted a 
263-residue integral membrane protein 
with cytoplasmic amino and carboxyl 



termini and six bilayer-spanning 
domains. The amino- and carboxy- 
terminal halves of MIP are themselves 
sequence related, forming a unique struc- 
ture in which the topology of the first and 
second halves of the molecule are oriented 
at 180° to each other (Fig. 1); the MIP 
family already has more than 20 mem- 
bers, including: NOD26, a protein in the 
symbiosome membrane of soybean root 
nodules; the glycerol facilitator of 
Escherichia coli and other bacteria; a sup- 
pressor of a Saccharomyces cerevisiae 
growth defect on fermentable sugars; a 
developmentally regulated Drosophila 
brain protein; and multiple tonoplast 
intrinsic proteins in plants (Ref. 5 and ref- 
erences therein). The functions of many 
MIP family members have yet to be 
established, but several plant and ani- 
mal MlP-family members have been 
shown to be water-specific membrane 
channels. 

Structure of the aquaporins 

As with MIP, the 28kDa channel- 
forming integral protein (CHIP) was first 
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8-11 November 1994 

Polymerase Chain Reaction 
(PGR) Applications/Cycle DNA 
Sequencing, Maryland, USA. 
(ATCC/Workshop Manager, 1 230 I 
Parklawn Drive. Rockviile, MD 
20852. USA. Tel; +1 301 231 5566. 
Fax: +1 3017701805.) 

8-21 November 1994 

Molecular Markers for Plant 
Breeding and Plant Genetics - 
Course, Cold Spring Harbor, NY, 
USA. (Meetings Office, Cold 
Spring Harbor Laboratory, 1 
Bungtown Road, Cold Spring 
Harbor, NY 11724-2213. USA.) 

8*21 November 1994 

Monoclonal Antibodies from 
Combinatorial Libraries — 
Course, Cold Spring Harbor, NY, 
USA. (Meetings Office, Cold 
Spring Harbor Laboratory, 1 
Bungtown Road, Cold Spring 
Harbor, NY 1 1724-2213. USA) 

14- 18 November 1994 
TJlstrastructural 
Immunocyto chemistry 
Course, Sutton, UK (Royal 
Microscopical Society, 37-38 St 
Clements. Oxford. UK OX4 lAf.) 

15- 18 November 1994 * 

Biopolymers and Byproducts; 
Structure, Function and 
Applications, Bangkok, 
Thailand (Prof. Jisnuson Svasti. 
Department of Biochemistry, 
Faculty of Science. Mahidol 
University, Rama VI Road, 
Bangkok 10400. Thailand. Tel: 
+66 2 2461360 to 69, ext 
430S/4306. Fax: +66 2 2480375 
or2S96634.) 

16- 18 November 1994 
Cell Culture and 
Hybxidomas: Quality Control 
and Cryop reservation 
Techniques, Maryland. USA. 
(ATCC/Workshop Manager, 
12301 Parklawn Drive, Rockviile, 
MD 20852. USA Tel: 

+ 1 301 331 5566. Fax: +1 301 770 
1805.) 



16-20 November 1994 * 
Immunobiology of Proteins 
and Peptides, Rio Rico, AZ, USA. 
(Dr Garvin S. Bixler, Jr. Univax 
Biologies Inc., 1 2280 Wilkins 
Avenue, Rockviile. MD 20852, 
USA. Tel: +1 301 7703099. 
Fax: +1 301 770 2014.) 

20-22 November 1994 ★ 
International Symposium on 
Genomic Imprinting/, Florence, 
Italy. (Pro! M. U Giovannucci 
Uzielli, Human Genetics Service, . 
University of Florence. Via 
Masaccio 209. Firenze 50132, 
Italy. Tel: +39 55 5662 931. " 
Fax: +3955 5662 918.) 

28-29 November 1994 
Adhesion Proteins, Paris, 
France. (Prof. B. Boris Vargafiig. 
Institut Pasteur, 28 rue du Dr 
Koux, 75015 Paris. France. Tel: 
+33 1 4568 8682/8688. Fax: 
+33 1 4568 8703.) 

1-2 December 1994 
Cycling out of Control: the 
Cell Cycle in Growth, 
Differentiation and Disease, 

Paris, France. (Pippa Burnett. 
Nature, Macnullan Magazines Ltd, 
4 little Essex Street, London, UK 
WC2R3LF. Tel: +44 (0)71 836 
663. Fax: +44 (0)71 279 5417.) 

7-9 December 1994 
IBC: 5th Annual International 
Conference, Lajolla, CA, USA. 
• (IBC USA Conferences. 225 
Turnpike Road. Southborough, 
MA01772-1647.USA.Tel: +1 508 
4818400. Fax: +1 508 481 
7911.) 

12-14 December 1994 * 
Molecular Biology and 
Cancer Network; 1 1th Annual 
Meeting, Warwick. UK. (EC 
Weston. Institute of Cancer 
Research, Fulham Road, London, 
UK SW3 6JB. Fax: +44 (0)71 352 
3299.) 



14-16 December 1994 
An Introduction to Genetic 
Engineering, Coventry, UK. 
(Carol Howes. Dept of Biological 
Sciences, University of Warwick. 
UK CV4 7AL,Tet+44 203 
523517. Fax:+44 203 523568.) 

15 December 1 994 i 
Cell Cycle Control and 
Ap op to sis: Basic Mechanisms 
and Implications for Treating 
Malignant Disease, London, 
UK (Mrs J. Weniey, Symposium 
Administrator, Department of 
Molecular Pathology. UCL 
Medical School. Windeyer 
Building, 46 Cleveland Street, 
London, UK W1P 6DB. Tel: + 07 1 
380 9343. Fax: +44 71 367 3310.) 

$-11 January 1995 
Oncogenes: 20 Years Later, 

Keystone, CO, USA. (Keystone 
Symposia. Drawer 1630, Silver- 
thome. CO 80498. USA. Tel: 
+ 1 303 262 1230.) 

26-28 January 1995 * 
Ions and Drugs: Active 
Transport inEukaryotic Cells, 

Villard de Lans. France. (Dr Yves 
Dupont. DBMS/BMC. Grenoble, 
France. Tel: +33 76 88 46. Fax: 
+33 76 88 S4 87.) 

5- 9 February 1995 * 

Gordon Research Conference 
on Carotenoids, Ventura. CA, 
USA (James A. Olson. Towa State 
University. Biochemistry and 
Biophysics, Ames, IA 5001 1, USA. 
Tel: + 1 5 1 5 294 3068. Fax: + 1 51 5 
2944141.) 

6- 7 April 1995 * 

BSCB/British Connective 
Tissue Society - Molecular 
and Cell Interactions in 
Connective Tissue, 

Manchester, UK. (Karl Kadler and 
Martin Humphries, University of 
Cambridge, Tennis Court Road, 
Cambridge, UK CB2 1QP. Fax: 
+ 44 61 275 5082.) 
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18-22 April 1995 * 
BSCB/BSDB Spring Meeting - 
Stem Ceils, Canterbury, UK 
(Enrico Coen and Fiona Watt, 
CRC/Welicome, Cambridge. 
UK) 



18-22 April 1995 * 
BSCB Spring Meeting - 
Organization of the Cell 
Nucleus, Canterbury. UK. (Ron 
Laskey, Tony Kouzarides and 
Colin Dingwall. CRC/WeUcome. 
Cambridge, UK) . 

1 8-22 April 1995 
BSCB Spring Meeting - 
Workshop on Muscle Cells, 

Canterbury, UK (Simon Hughes 
and kob Simmons. BSCB/BSDB, 
University of Kent, Fax: +44 227 
763912.) 



18-22 April 1995 * 
BSCB Spring Meeting - Cell 
Movements in Development, 

Canterbury. UK, (Anthony Baines 
(BSCB) and Clive Wilson (BSDB). 
Fax: +44 227 763912.) 

22-24 May 1995 * 
22nd Stadler Genetics 
Symposium: All Aspects of 
Genome Analysis Including 
Plant and Animal, Columbia, 
MO. USA. (J. P- Gusta&on, 206 
Curtis Hall University of Missouri- 
Columbia. Columbia. MO 6521 1 . 
USA.) . 

18-23 August 1996 * 
9th International Congress of 
Human Genetics, Rio de 

Janeiro. Brazil. (Dr Henrique 
Krieger. Secretary General 9th 
International Congress of Human 
Genetics, Instituto de Ciencias 
Biomedicas. Universidade de Sac- 
Paulo, Cx. Postal 66208, 05389- 
970 Sao Paulo. SP-Brazil. Tel: 
+55 11 818 7328. Fax: +55 11 818 
7417 or +55 11 210 4817. Email: 
9ichg@biomed.icb2.usp.br) 
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